-
Notifications
You must be signed in to change notification settings - Fork 3.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
whisper : add support for large v3 #1444
Conversation
I cannot push to HuggingFace - any idea what is wrong? git push
batch response: Authorization error. B | 0 B/s
Uploading LFS objects: 0% (0/2), 0 B | 0 B/s, done.
error: failed to push some refs to 'https://huggingface.co/ggerganov/whisper.cpp' I have created a write access token and have used Edit: this fixed the issue https://discuss.huggingface.co/t/cant-push-to-new-space/35319/24 Pushing the new model to https://huggingface.co/ggerganov/whisper.cpp |
Thanks for quickly supporting v3 model! |
I found the same that v3 repeats more than v2, even with VAD audios. @arabcoders also mentioned it duplicates more in Japanese under the official whisper repo in here. Again, thank you for the quick support on v3 model and keep the v2 as well. This is the only one whisper project that works like a charm on my Mac Studio with Metal/MPS and even CoreML enabled! |
* whisper : add support for large v3 * bench : fix build + fix go bindings * bench : fix n_mels * models : update readme
Awesome, thanks for quick support ;) |
Also in my first tests i found that, V3 large repeats or hallucinates a LOT more than V2. Not sure if it was a good idea that the V3 model is now the default large model - at least not without the obviously needed changes that mitigate the new repetitions |
* whisper : add support for large v3 * bench : fix build + fix go bindings * bench : fix n_mels * models : update readme
* whisper : add support for large v3 * bench : fix build + fix go bindings * bench : fix n_mels * models : update readme
* whisper : add support for large v3 * bench : fix build + fix go bindings * bench : fix n_mels * models : update readme
NOTE: re-download
ggml-large.bin
to get the v3 versionggml-large.bin
is the new v3 modelggml-large-v2.bin
is the old v2 modelThis should be ready to merge.
I did some anecdotal tests using the audio samples in this repo and seems like v3 tends to repeat some lines more than v2. Could be a problem on
whisper.cpp
side, though I ran one of the audio samples with the OG whisper and it repeats in a similar way:Anyway, we can't make any conclusions based on this single case, so will merge this for now and see what people report.
Edit: ran one more example with the OG whisper and this one even produces wrong characters (starts at
01:27.220
):Not sure if I'm doing something wrong - would be helpful if people can confirm this.